LOAD BALANCING SYSTEM AND METHOD IN A MULTIPROCESSOR SYSTEM 



The present invention relates generally to multiprocessor systems and particularly to a method 
of balancing query loads in a multiprocessor system. 

BACKGROUND OF THE INVENTION 

5 

Multiprocessor systems which service a multitude of requests (e.g., queries) from a 
large number of users must use their resources efficiently in order to service the requests 
without undue delay. This is especially true for web servers which receive queries for service 
from multiple users simultaneously. Queries from users maybe for a particular web page or 

10 for a set of search results, in the case where the server is a web crawler server. If there is 
undue delay in servicing the requests, the users may discontinue use of the web server. 

In a multiprocessor system, load balancing is required to distribute received queries 
among a plurality of processors to allow efficient servicing of the queries or other requests. If 
some processors are heavily loaded with queries while other processors are lightly loaded or 

15 remain unused for periods of time, the average time required to service a query can rise 
considerably. 

Prior load balancers known in the art have managed distribution of requests or queries 
in a round robin basis or have directed requests to the processor with the lowest load for 
servicing. The round robin method, however, does not take into account possible variations in 
20 processor load capacity or processing power among the plurality of processors in the system. 
Therefore, over time, processors with a lower load capacity would be heavily loaded while 
processors with a higher load capacity would be lightly loaded or remain unused for periods 
of time. 

The method of directing requests for service to the processor with the lowest load 
25 also has a drawback in that once the processor with the lowest load is identified, it will draw 
the most traffic, an effect known as herding. 

Another method of request distribution is to randomly select a set of processors and 
assign the request to the processor in this set with lowest associated load for servicing. 
However, this approach does not take into account that there may be a preferred choice of 
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processors and this method can expend valuable resources by querying and comparing load 
values of several processors. See for example U.S. Patent No. 5,991,808. 

Various load balancing methods have been used in multiprocessor systems. However, 
there remains a need for a simple and efficient method for balancing the load distributions of 
5 a multiprocessor system which does not unduly expend valuable system resources. 

SUMMARY OF THE INVENTION 

The present invention includes a method for choosing a resource, among a plurality of 
10 resources, for servicing a request, such as query. In accordance with the present invention a 
first resource is randomly selected among the plurality of resources in accordance with a 
predefined first random selection function. Each of the resources has an associated load 
value. The load value of the first resource is compared to a threshold value, and upon 
determining that the first load value does not exceed the threshold value, the request is 
1 5 assigned to the first resource for servicing the request. 

In another aspect of the method of the present invention, when it is determined that 
the first load value exceeds the predetermined threshold value, a second resource is randomly 
selected among the plurality of resources in accordance with a predefined second random 
selection function. The load value of the second resource is compared to the threshold value, 
20 and upon determining that the second load value does not exceed the threshold value, the 
request is assigned to the second resource for servicing the request. 

In yet another aspect of the method of the present invention, when it is determined 
that the second load value exceeds the predetermined threshold value, the first and second 
load values are compared and the request is assigned to one of the first resource and second 
25 resource having a lower associated load value for servicing the request. 

In a preferred embodiment, the resources to which queries are allocated are 
processors, one of whose functions is to process queries. 

The present invention also includes a system for selecting a resource, among a 
plurality of resources, for servicing a request. The system includes one or more interfaces for 
30 receiving a request and for forwarding the request to a selected resource and control logic. 
The control logic is configured to randomly select a first resource among the pluraUty of 
resources in accordance with a predefined first random selection function, the first resource 
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having an associated first load value, compare the first load value to a predetermined 
threshold value to determine whether the first load value exceeds the predetermined threshold 
value, and assign the request to the first resource for servicing the request when the first load 
value does not exceed the predetermined threshold value. The control logic, in some 
embodiments, is implemented as one or more processing units for executing stored 
procedures, and a set of request allocation procedures stored in memory within the system. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Additional objects and features of the invention will be more readily apparent fi-om 
the following detailed description and appended claims when taken in conjunction with the 
drawings, in which: 

FIG. 1 is a block diagram of a multiprocessor system environment. 

FIG. 2 is a block diagram of the query integrator shown in FIG. 1. 

FIGS. 3 A and 3B depicts two examples of the data contained in the processor data 
table shown in FIG. 2. 

FIG. 4 is a flow chart showing a random processor assignment procedure in 
accordance with the present invention. 

FIG. 5 is a flow chart showing an alternative embodiment of the random processor 
assignment procedure in accordance with the present invention. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Query Handling System - General Architecture 

Referring to FIG.l, the environment of a multiple processor system 100 is shown to 
include a plurality of front end processors FE(1) to FE(j) 104, a plurality of query integrators 
QI(1) to QI(m) 200, and a plurality of back end processors P(l) to P(n) 108. Queries 102 are 
initially received by the front end processors 104. Each front end processor 104 operates 
separately and independently of the other fi-ont end processors 104. Similarly, each query 
integrator 200 operates separately and independentiy of the other query integrators 200. 
Additionally, each of the back end processors 108 operate separately and independently of 
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each other. The multiprocessor system 100 maybe a either a homogeneous system in which 
each of the processors have substantially similar load capacity values or processor speeds, or 
a heterogeneous system in which the processors have different load capacity values or 
processor speeds. 

The queries 102 are sent from multiple users to the front end processors 104, which 
randomly direct each individual query 102 to a single query integrator 200 among the 
plurahty of query integrators. The queries may be randomly assigned to the query integrators, 
or may be assigned in roimd robin fashion by each front end processor or in any other manner 
that reasonably distributes the computational load associated with query allocation among the 
back end processors. Upon receiving a query, each query integrator randomly or pseudo 
randomly directs the query to one of the back end processors 108 to service or execute the 
query. Each query integrator uses a random processor assignment procedure of the present 
invention to randomly assign a query to a back end processor 108. 

Query Integrator 

The job of the query integrators 200 is to distribute queries among the back end 
processors so as to balance the computational load among the back end processors. The 
query integrators randomly assign queries among the processors in a manner that is weighted 
in accordance with both the load handling capabilities and the actual computational loads of 
the back end processors. 

Referring to FIG. 2, each query integrator 200 includes a network interface 202, a 
central processing unit (CPU) 204, a memory 208, and an interconnecting bus 206. Queries 
are received from the front end processors 104 via the network interface 202, and the queries 
are also forwarded for processing to the back end processors 108 via the network interface 
202. In another embodiment, different communication interfaces are used for receiving and 
forwarding the queries. 

The memory 208 stores procedures and data structures including a random processor 
assignment procedure 400, a set of hash functions 210, a set of mapping functions 212, an 
update unavailable processors procedure 214, an update load value data procedure 216, and a 
processor data table 218. The processor data table 218 contains a plurality of processor 
profiles 220(1) to 220(N). Each processor profile 220(1) to 220(N) contains a processor 
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identifier 221 and information regarding the processor corresponding to the processor 
identifier. This information preferably includes a load value 222, a load capacity value 224, 
and a processor unavailable flag or indicator 226. The processor data table 218 also includes 
a total load capacity value 250 which is calculated by summing the load capacity values 224 
5 in the processor data table 218. More specifically, the total load capacity value 250 is the 
sum of the load capacity values 224 of the processors, excluding the processors that are 
marked as being unavailable. 

In an alternate embodiment, the CPU 204 and procedures executed by the CPU to 
randomly or pseudo randomly direct queries to the back end processors are performed by 
10 control logic that is implemented in whole or in part using hardwired logic circuitry. From 
another viewpoint, the query integrator's CPU and the procedures it executed to assign 
queries to back end processors comprise a set of control logic. 
, Processor Data Table fields. Within each profile 220, the processor identifier 22 1 

; - ' uniquely identifies a particular processor. The load value 222 of a processor is based on how 
45 long a query must wait before it can be serviced by the processor, i.e., the latency period of a 
processor. Each query integrator 200 automatically updates the load value data for each 
processor at predetermined intervals using the update load value data procedure 216 
- z[ (described below). The load capacity value 224 of a processor is based on the processor's 
I ' processing speed or power. The processors unavailable flag 226 indicates whether or not the 
f20 processor is unavailable, i.e., not in service, and is incapable of servicing any queries. 

Overview of Query Integrator Procedures 

The random processor assignment procedure 400 uses the data described above from 
25 the processor data table 218 and integrates the hash functions 210, the mapping functions 
212, the update unavailable processors procedure 214, the update load value data procedure 
216, and other procediires (as described below in FIG. 4) to randomly assign a processor to a 
received query for servicing the query. 

The set of hash functions 210 contained in memory 208 include three or four distinct 
30 hash functions that are used in different situations to randomly select a processor (as 

described below). Generally, a hash function generates a hash value or hash number from a 
string of text in such a way that it is extremely unlikely that two different strings of text 
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produce identical hash values, hi the present invention, hash functions are used produce a 
hash value R which is a fixed length numeric representation of the received query text: R = 
HashFunction(query). Many suitable hash functions are known to those skilled in the art, 
including checksum functions and fingerprint functions, and thus are not described here, hi an 
5 alternate embodiment, only one hash function is used. 

The set of mapping functions 212 contained in memory 208 include separate and 
distinct mapping functions which are used in different situations to map the hash values 
produced by the hash fiinctions 21 0 to a processor (as described below). In a preferred 
embodiment, the mapping functions apply a modulo function to determine a mapping value X 

10 which is then mapped to a selected processor, hi the preferred embodiment, the mapping 

function takes into consideration the load capacity value of each processor, which weights the 
processor's probability of selection. Therefore, the selection of processors by the mapping 

; functions is pseudo random since selection is weighted by the capacity of the processor giving 

1= ] a processor with a higher capabihty a greater probability of being selected. 

l§ The update imavailable processors procedure 214 contained in memory 208 may be 

' , called by any of the query integrators 200 when a randomly selected processor is determined 
to be unavailable. When the update unavailable processors procedure 214 is called, each 

|:'= processor is queried to determine whether the processor is available or unavailable. The 
unavailable flags 226 for each processor in the processor data table 21 8 is then updated to 

-pi) reflect the new data regarding the availability or unavailability of each processor. A new total 
load capacity value 250 is then calculated by summing the load capacity values 224 of all 
processors excluding any unavailable processors. In a preferred embodiment, the update 
unavailable processors procedure 214 is also executed in the backgroimd at predetermined 
intervals, for example, every 200 ms. 

25 Table 1 contains a pseudo-code representation of the update unavailable processors 

procedure 214. The pseudo-code used in Tables 1-5 uses universal computer language 
conventions. While the pseudo-code employed here has been invented solely for the 
purposes of this description, it is designed to be easily understandable by any computer 
programmer skilled in the art. 
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Table 1 

Update Unavailable Processors Procedure 

update unavailable processors () { 
5 query processors for a.vailability; 

update processor unavailable flags in the processor data table; 
calculate new total load capacity value { 

sum the load capacities of all available processors; 

}; 

1 0 update total load capacity value in the processor data table; 

}; 

End; 

The update load value data procedure 216 contained in memory 208 is used to update 
fl5 load values 222 at predetermined intervals, for example, every 2 ms. The update load value 
: data procedure 216 queries each processor for its current load value and updates the load 

value 222 in the processor data table 21 8 for each processor profile 220(1) to 220(N). The 
^ -i update load value data procedure 216 is performed by the query integrator 200 automatically 
: and is repeated indefinitely. 

-20 Table 2 contains a pseudo-code representation of the update load value data procedure 

216. 

Table 2 
Update Load Data Procedure 

25 update load value () { 

repeat indefinitely { 
do: 

w^ait (T) /* perform update every T ms */ 
query processors for current load values; 
30 update load values in the processor data table; 

}; 

}; 

End; 

35 Processor Data Table Examples 

FIGS. 3 A and 3B show two examples of the data contained in the processor data table 
(218 of FIG. 2). FIG. 3 A depicts exemplary data contained in the processor data table (218 of 
FIG. 2) when no processors are determined to be unavailable. FIG. 3B depicts the exemplary 
40 data shown in FIG. 3A after two processors have been found to be imavailable and the update 
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unavailable processors procedure (214 of FIG. 2) has been called. Thus, the total load 
capacity value 250 (FIG. 2) is the sum of the load capacity values 224 of the processors, 
excluding the processors that are marked as being unavailable. 

As described above, the "load value" fields in the processor data table represent a 
5 processing latency for each processor while the load capacity value for each processor 
represents the total processing power of the processor. 

Random Processor Assignment Procedure 

10 Referring to FIG. 4, a flow chart for the random processor assignment procedure (400 

of FIG. 2) is shown in accordance with the present invention. The random processor 
assignment procedure 400 is used by the query integrator (200 of FIG. 1) to randomly assign 
each received query a back end processor to service the query. After the query integrator 
receives a query from a front end processor (104 of FIG. 1), the query integrator randomly 

15 selects a first selected processor PI using a first hash function and a first mapping function at 
step 402. 

A pseudo-code representation of step 402 is shown in Table 3. The query integrator 
apphes a first hash function HashFunctionl to the query to obtain a hash value Rl . The query 
integrator then applies a first mapping function MFl on hash value Rl to map Rl to a 

20 processor PI. More specifically, the mapping function applies a modulo function on the hash 
value to derive a mapping value XI using the total load capacity value (TC) of the pluraUty of 
resources as the modulus value: XI = Rl modulo TC. Therefore, the obtained mapping 
value XI is between the values 0 and the total load capacity value minus 1. The mapping 
value XI is then mapped to a processor among the plurality of processors. The preferred 

25 method of such a mapping is to sum the load capacities of the processors starting from the 
first processor until the sum exceeds the mapping value XI, and then selecting the processor 
which corresponds to the last load capacity that was summed. This method, in effect, assigns 
a range of mapping values to each processor and then determines the processor whose range 
includes XI. 

30 In an alternate embodiment, a range of mapping values could be computed in advance 

and assigned to each processor, for instance by storing the mapping range in the processor 
data table, and then searching the table for the processor whose range includes XI . 
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Therefore, the selection of the processor PI is pseudo random since the selection is 
weighted by the capacities of the processors, giving a processor with a higher capability a 
greater probability of being selected. By doing so, the method of query load balancing of the 
present invention takes into account that there may be a preferred choice of processors. The 
5 advantage of such an approach is that processors with greater capability are more likely to be 
selected, thereby increasing system performance since the more powerful processors will 
execute the queries faster with a reduced latency period. 



Table 3 

1 0 Randomly Select Processor P 1 - Step 402 

Rl = HashFunctionl (Query) 

/* Apply Mapping function: */ 
15 XI = Rl modulo TC; /* apply a modulo function to the hash value Rl 

with the total load capacity value as the modulus to obtain a mapping value XI 
which is between 0 and (TC-1) */ 

/* PI = Map (XI, processor data table): */ 
20 Y= 0; /* set a summing variable to 0 */ 

P = 0; /* set a processor identifier variable to 0 */ 

Do until Y> XI { 

P = P + 1 ; /* select the next processor identifier */ 
Skip imavailable processors, and adjust P as necessary; 
25 Y = Y + capacity (P); /*sum the load capacity of the processors */ 

}; 

PI = P; /* the first selected processor PI is the processor 
corresponding to the last load capacity summed */ 

30 Next, at step 404 the query integrator checks the first selected processor PI to 

determine if the processor is available or unavailable. If the first selected processor PI is 
determined to be unavailable, the query integrator calls the update unavailable processors 
procedure (214 of FIG. 2), at step 406, to update the processor unavailable flag (226 of FIG. 
2) for each processor and to obtain a new total load capacity value (250 of FIG. 2) which 

35 excludes the load capacities of the unavailable processors. A pseudo-code representation of 
the update unavailable processors procedure is shown in Table 1. 

At step 408, the query integrator then randomly selects a new processor PI using a 
second hash function and the first mapping function. 
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A pseudo-code representation of step 408 is shown in Table 4. The query integrator 
appUes a second hash function HashFunction2 to the query to obtain a hash value R2. The 
second hash function HashFunction2 may be the same hash function as HashFvmctionl or 
may be a different hash function. 
5 The query integrator then applies the first mapping function MFl on hash value R2 to 

map R2 to a new processor PI . More specifically, the mapping function applies a modulo 
function on the hash value to derive a mapping value X2: X2 = R2 modulo TC. Therefore, 
the obtained mapping value X2 is between the values 0 and the total load capacity value TC 
minus 1 . The mapping value X2 is then mapped to a processor among the plurality of 

1 0 available processors. The preferred method of such a mapping is to sum the load capacities 
of the available processors, starting fi:om the first available processor until the sum exceeds 
the mapping value X2, and then selecting the processor which corresponds to the last load 

ff ; capacity that was summed. This method, in effect, assigns a range of mapping values to each 
available processor and then determines the processor whose range includes X2. 

II 

r ' Table 4 

Randomly Select New Processor PI (New PI) - Step 408 

R2 = HashFunction2(Query) 

26 

/* Apply Mapping function: */ 

X2 = R2 modulo TC; /* apply a modulo function to the hash value R2 

with the total load capacity value as the modulus to obtain a mapping value X2 
which is between 0 and (TC-1) */ 

25 

/* PI = Map (X2, processor data table): */ 

Y=0; /* set a summing variable to 0 */ 

P = 0; /* set a processor identifier variable to 0 */ 

Do until Y>X2 { 

30 P = P + 1; /* select the next processor identifier */ 

Skip unavailable processors, and adjust P as necessary; 
Y = Y + capacity (P); /* sum the load capacity of the processors */ 

}; 

PI = P; /* the selected processor PI is the processor 
35 corresponding to the last load capacity summed */ 
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At step 410, the newly selected processor PI is assigned to service the query. The 
random processor assignment procedure 400 then ends at step 412 and the next received 
query begins processing at step 402. 

If the first selected processor PI is determined to be available, at step 404, the query 
5 integrator then compares the first selected processor's load value to a predetermined threshold 
value at step 414. If the query integrator determines (step 414-No) that the first selected 
processor's load value is not greater than the predetermined threshold value, the query 
integrator assigns the query to the first selected processor PI at step 416 and the random 
processor assignment procedure 400 ends at step 418. Otherwise (step 414- Yes), if the query 
10 integrator determines that the load value of the first selected processor is greater than the 

predetermined threshold value, the query integrator applies a third hash fimction and a second 
O mapping fimction to the query to randomly select a second processor P2 at step 420. 
fil The third hash fimction can be the same as the hash fimction used in step 408, since 

J'Js steps 408 and 420 are mutually exclusive in this embodiment. However, the third hash 

function should be different from the first hash function used to select processor P 1 in step 
PI 402. A third hash function, different from the first hash fimction, is used to hash the query 
ri text to improve the chances that the second selected processor will be randomly selected with 
y ^ respect to the first selected processor. Alternately, a different seed value can be used with the 
If! first hash fimction so that the first hash fimction, when apphed to the query text, produces a 
if) random value that is not correlated with the first selected processor. For the purposes of this 
document, when a hash fimction is used with a new seed value, that will be considered to be a 
different hash fimction. A "seed value" in this context is conceptually similar to 
concatenating a seed string to the beginning of the query text that is to be processed by the 
hash function. 

25 The second mapping fimction excludes the first selected processor PI from the set of 

processors that can be selected. 

A pseudo-code representation of step 420 is shown in Table 5. The query integrator 
applies the third hash fimction HashFunctionS to the query to obtain a hash value R3. The 
query integrator then applies a second mapping fimction MF2 on hash value R3 to map R3 to 

30 a processor P2. More specifically, the mapping fimction applies a modulo fianction on the 

hash value to derive a mapping value X2 using an adjusted total load capacity value (TC2) of 
the plurality of resources, excluding processor PI, as the modulus value: X2 = R3 modulo 
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TC2. Therefore, the obtained mapping value X2 is between the values 0 and the adjusted 
total load capacity value TC2 minus 1. The mapping value X2 is then mapped to a processor 
among the plurality of processors, excluding processor PI . The preferred method of such a 
mapping is to sum the load capacities of the processors, excluding processor PI, starting from 
the first processor until the sum exceeds the mapping value X2, and then selecting the 
processor which corresponds to the last load capacity that was summed. This method, in 
effect, assigns a range of mapping values to each processor and then determines the processor 
whose range includes X2. 



Table 5 

Randomly Select Processor P2 - Step 420 
R3 = HashFunctionS (Query) 
/* Apply Mapping function: */ 

TC2 = TC - capacity(Pl) /* total load capacity excluding load capacity of PI */ 
X2 = R3 modulo TC2; /* apply a modulo function to the hash value R3 

with the adjusted total load capacity value as the modulus to obtain a mapping 

value X2 which is between 0 and (TC2-1) */ 

/* P2 = Map2 (X2, PI, processor data table): */ 

Y= 0; /* set a summing variable to 0 */ 

p = 0; /* set a processor identifier variable to 0 */ 

Do until Y>X2 { 

P = P + 1 ; /* select the next processor identifier */ 

Skip unavailable processors, and adjust P as necessary; 
IfP=Pl {P=P+1} /* skip processor PI */ 

Y = Y + capacity (P); /* sum the load capacity of the processors */ 

}; 

P2 = P; /* the selected processor P2 is the processor 

corresponding to the last load capacity summed */ 

Next, at step 422 the query integrator checks the second selected processor P2 to 
determine if the processor is available or unavailable. If the first selected processor P2 is 
determined to be unavailable, the query integrator assigns the previously selected processor 
PI to handle the query at step 424, and then the random processor assignment procedure 400 
ends at step 426. 

If the second processor P2 is available (step 422-No), the query integrator then 
compares the second selected processor's load value, called the second load value, to the 
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predetermined threshold value at step 428. If the query integrator determines that the second 
load value is not greater than the predetermined threshold value, the query integrator assigns 
the query to the second selected processor P2 for service at step 430 and the random 
processor assignment procedure 400 ends at step 432. If the query integrator determines that 
5 the second load value is greater than the predetermined threshold value, the query integrator 
assigns the query to the first or second selected processor, whichever has the lower associated 
load value, for servicing at step 434. The random processor assignment procedure 400 then 
ends at step 436. 

In an alternate embodiment, when the first selected processor is not available (step 
1 0 404- yes), the query is then assigned by selecting the next available processor in the processor 

data table, moving through the table in a predefined direction until an available processor is 
i ; . found. However, this method may overload a processor adj acent (in the processor data table) 
J^! to a unavailable processor since it would receive all of the load from the unavailable 

processor. 

15 To reduce the computational resources used by the query integrators to assign queries 

- ; = to back end processors, the predetermined load threshold used in steps 414 and 428 may be 

set to a high value, which will increase the chance of a query being assigned to the first 
i-1 selected processor and avoiding steps 420 through 436 of the random assignment procedure. 
' - Except when the first selected back end processor is heavily loaded, the hash function and 
^2b mapping fiinctions will not be executed to select a second processor. On the other hand, to 

improve load balancing among the back end processors it may be preferred to set the 

predetermined load threshold to a lower value, which increases the chances of each query 

being assigned to a hghtly loaded back end processor. 

25 Alternate Embodiment of Random Assignment Procedure 

An alternate embodiment 500 of the random assignment procedure is shown in 
FIG. 5. Only the aspects of this embodiment which differ from the embodiment shown in 
FIG. 4 will be described. 

30 In this embodiment, if the first selected processor is unavailable (step 404-yes) and a 

new first processor PI has been selected (step 408), the procedure does not exit at this point. 
Instead, the procedure continues at step 414, via program control path 502. Thus, the load of 
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the new processor PI will be compared with the threshold and a new processor P2 will be 
randomly selected (step 420) if the load of the new processor PI is greater than the threshold 
(step 414-yes). 

A second difference is that if the second selected processor is unavailable (step 422- 
yes), the update unavailable processors procedure (214 of FIG. 2) is called in step 520, and 
then a new processor P2 is selected in step 522. The procedure for selecting the new 
processor P2 is the same as the one for step 420 and is represented in pseudo-code form in 
Table 5. The hashing function used in step 522 may be the same hashing function as the one 
used in step 420, or it may be a different one. However, in general, the hashing function used 
in step 522 should be different from the one used in steps 402 and 408. 

After step 522, the procedure continues at step 428. Thus, the load of the new 
processor P2 will be compared with the threshold and the query will be assigned to the first or 
second selected processor having the lower associated load value (step 434) if the load of the 
new processor P2 is greater than the threshold (step 428-yes). 

Alternate Embodiment of the Mapping Functions 

hi the preferred embodiment described above, the mapping ftmctions apply a modulo 
function to determine a mapping value X which is then mapped to a selected processor. 
An example of a preferred method of such a mapping is to sum the load capacities of the 
processors starting from the furst processor until the sum exceeds the mapping value X, and 
then selecting the processor which corresponds to the last load capacity that was summed. 
This method, in effect, assigns a range of mapping values to each processor and then 
determines the processor whose range includes X. 

In an alternate embodiment, a range of mapping values could be computed in advance 
and assigned to each processor, for instance by storing the mapping range in the processor 
data table, and then searching the table for the processor whose range includes X. 

Both of these embodiments of the mapping fimction uses a probability density 
function to map a randomly selected X to a processor P. The selection probability for each 
available processor corresponds to its load capacity, and these load capacities are mapped into 
a probability density function by the processor data table and the mapping function that reads 
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the processor data table to select a processor. A mathematical description of the probability 
density function is as follows. 

N 

rC = ^ capacity{Pi) 

i=i 

where the index i is used only to identify available processors and N is the number of 
available processors. The "capacityCP;)" represents the selection probability of a processor i. 
TC is the total of the selection probabilities. 

The mapping is therefore obtained by solving for the smallest value of S to satisfy the 
following equation: 

^ capacity{P^) > X 
i=i 

where X is a randomly selected value between 0 and TC-1 . 

The present invention can be used for load balancing in a variety of multiprocessor systems, 
including system handling requests other than queries. The present invention can be 
implemented as a computer program product that includes a computer program mechanism 
embedded in a computer readable storage medium. For instance, the computer program 
product could contain the program modules shown in Fig. 2. These program modules maybe 
stored on a CD-ROM, magnetic disk storage product, or any other computer readable data or 
program storage product. The software modules in the computer program product may also 
be distributed electronically, via the hitemet or otherwise, by transmission of a computer data 
signal (in which the software modules are embedded) on a carrier wave. 

While the present invention has been described with reference to a few specific embodiments, 
the description is illustrative of the invention and is not to be construed as limiting the 
invention. Various modifications may occur to those skilled in the art without departing firom 
the true spirit and scope of the invention as defined by the appended claims. 
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